Efficient Confident Search in Large Review Corpora
نویسندگان
چکیده
Given an extensive corpus of reviews on an item, a potential customer goes through the expressed opinions and collects information, in order to form an educated opinion and, ultimately, make a purchase decision. This task is often hindered by false reviews, that fail to capture the true quality of the item’s attributes. These reviews may be based on insufficient information or may even be fraudulent, submitted to manipulate the item’s reputation. In this paper, we formalize the Confident Search paradigm for review corpora. We then present a complete search framework which, given a set of item attributes, is able to efficiently search through a large corpus and select a compact set of high-quality reviews that accurately captures the overall consensus of the reviewers on the specified attributes. We also introduce CREST (Confident REview Search Tool), a user-friendly implementation of our framework and a valuable tool for any person dealing with large review corpora. The efficacy of our framework is demonstrated through a rigorous experimental evaluation.
منابع مشابه
مرور مؤثر نتایج جستجوی تصاویر با تلخیص بصری و متنوع از طریق خوشهبندی
With unprecedented growth in production of digital images and use of multimedia references, requirement of image and subject search has been increased. Systematic processing of this information is a basic prerequisite for effective analysis, organization and management of it. Likewise, large collections of images have been made available on the Web and many search engines have provided the poss...
متن کاملSary: Reusable Components and Tools for Searching Large Corpora
Since corpus-based natural language processing has to deal with large corpora, efficient searching of the large corpora is inevitably necessary. For example, one might want to examine how a word or a phrase is used in the large corpora or to collect frequencies of all terms in the large corpora. Our system Sary solves these problems by providing fast full-text search facilities for a single lar...
متن کاملInformation Retrieval and Large Text Structured Corpora
Conventional Information Retrieval Systems (IRSs), also called text indexers, deal with plain text documents or ones with a very elementary structure. These kinds of system are able to solve queries in a very efficient way, but they cannot take into account tags which mark different sections, or at best this capability is very limited. In contrast with this, nowadays, documents which are part o...
متن کاملExpressive and Efficient Retrieval of Symbolic Musical Data
The ideal content-based musical search engine for large corpora must be both expressive enough to meet the needs of a diverse user base and efficient enough to perform queries in a reasonable amount of time. In this paper, we present such a system, based on an existing advanced natural language search engine. In our design, musically meaningful searching is simply a special case of more general...
متن کاملEffect of Bone Borne Expansion and Tooth Borne Palatal Expansion on Airway Volume: A Review Article
Background and purpose: Transverse problems in the maxilla (high arched- narrow hard plates) can cause respiratory disorders. Palatal expansion can be helpful in this way. The present study aimed at evaluating the effect of bone borne expansion and tooth borne palatal expansion on airway volume. Materials and methods: A review study was performed by search in Google Scholar, Scopus, PubMed, Em...
متن کامل